Hardware Performance Monitoring in Multiprocessors
نویسندگان
چکیده
Multiprocessors are often quoted as being capable of a ‘peak performance,’ but in practise it is difficult to utilize this potential. Consequently, software applications must be well tuned to run efficiently. In this thesis, factors affecting performance in cache-coherent multiprocessors, particularly those which use a sequentially-consistent memory model, are analyzed. Cost-effective hardware features are suggested to measure these factors without intruding on software performance. Many of the proposed features can be readily incorporated into future processor designs, and others are easy to implement in external hardware. The measurements can be used to help tune a program’s performance as well as other purposes such as simulation verification, workload characterization, or runtime performance decision support. To demonstrate cost feasibility and other implementation concerns, the processor card hardware performance monitor developed for the NUMAchine multiprocessor is also described.
منابع مشابه
A Survey of Cache Coherence Mechanisms in Shared Memory Multiprocessors
This paper is a survey of cache coherence mechanisms in shared memory multiprocessors. Cache coherence is important to insure consistency and performance in scalable multiprocessors. A variety of hardware and software protocols have been proposed and investigated in simulation studies. Our contribution in this work is an in-depth survey of hardware and software protocols including recent work. ...
متن کاملHardware Prefetching in Bus-Based Multiprocessors: Pattern Characterization and Cost-Effective Hardware
Data prefetching has been widely studied as a technique to hide memory access latency in multiprocessors. Most recent research on hardware prefetching focuses either on uniprocessors, or on distributed shared memory (DSM) and other non bus-based organizations. However, in the context of bus-based SMPs, prefetching poses a number of problems related to the lack of scalability and limited bus ban...
متن کاملRunning Parallel Applications on an Mp with Multithreaded Superscalar Processors Running Parallel Applications on a Mp with Multithreaded Superscalar Processors
With lesser returns on adding more complexity to conventional superscalar processors, simultaneous multithreaded (SMT) superscalar processors seem to be a promising alternative. Unfortunately, most previous work has focused on systems running multiprogrammed loads of sequential applications. It is not clear how well these processors work in a shared-memory multiprocessor environment running par...
متن کاملDevelopment of hardware system using temperature and vibration maintenance models integration concepts for conventional machines monitoring: a case study
This article describes the integration of temperature and vibration models for maintenance monitoring of conventional machinery parts in which their optimal and best functionalities are affected by abnormal changes in temperature and vibration values thereby resulting in machine failures, machines breakdown, poor quality of products, inability to meeting customers’ demand, poor inventory contro...
متن کاملA comparative evaluation of hardware-only and software-only directory protocols in shared-memory multiprocessors
The hardware complexity of hardware-only directory protocols in shared-memory multiprocessors has motivated many researchers to emulate directory management by software handlers executed on the compute processors, called software-only directory protocols. In this paper, we evaluate the performance and design trade-offs between these two approaches in the same architectural simulation framework ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1996